Inverse Bayesian Optimization: Learning Human Acquisition Functions in an Exploration vs Exploitation Search Task

نویسندگان

چکیده

This paper introduces a probabilistic framework to estimate parameters of an acquisition function given observed human behavior that can be modeled as collection sample paths from Bayesian optimization procedure. The methodology involves defining likelihood on task, where the is parameterized by subroutine governed unknown function. structure enables us make inference subject's while allowing their deviate around solution subroutine. To test our methods, we designed sequential task which forced subjects balance exploration and exploitation in search invisible target location. Applying proposed methods resulting data, find many tend exhibit preferences beyond standard functions capture. Guided model discrepancies, augment candidate yield superior fit this task.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploration vs Exploitation in Bayesian Optimization

The problem of optimizing unknown costly-to-evaluate functions has been studied for a long time in the context of Bayesian Optimization. Leveraging the Lipschitz property of the unknown function, we propose an algorithm with a distinct exploration phase followed by an exploitation phase. The exploration phase aims to select samples that shrink the search space as much as possible. The exploitat...

متن کامل

A Lipschitz Exploration-Exploitation Scheme for Bayesian Optimization

The problem of optimizing unknown costly-to-evaluate functions has been studied extensively in the context of Bayesian optimization. Algorithms in this field aim to find the optimizer of the function by requesting only a few function evaluations at carefully selected locations. An ideal algorithm should maintain a perfect balance between exploration (probing unexplored areas) and exploitation (...

متن کامل

Age differences in information search: An exploration-exploitation tradeoff model

Past research often found that older adults searched less in terms of browsing and generating keywords; few studies examined the processes and underlying mechanism that caused the age-related reduction on search. In the current study, 20 younger and older adults performed ill-defined search tasks with a search box we implemented. In addition to the age differences in the quantities of search, r...

متن کامل

Search in patchy media: Exploitation-exploration tradeoff.

How to best exploit patchy resources? We introduce a minimal exploitation-migration model that incorporates the coupling between a searcher's trajectory, modeled by a random walk, and ensuing depletion of the environment by the searcher's consumption of resources. The searcher also migrates to a new patch when it takes S consecutive steps without finding resources. We compute the distribution o...

متن کامل

Exploration vs Exploitation vs Safety: Risk-Aware Multi-Armed Bandits

Motivated by applications in energy management, this paper presents the Multi-Armed Risk-Aware Bandit (MaRaB) algorithm. With the goal of limiting the exploration of risky arms, MaRaB takes as arm quality its conditional value at risk. When the usersupplied risk level goes to 0, the arm quality tends toward the essential infimum of the arm distribution density, and MaRaB tends toward the MIN mu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Bayesian Analysis

سال: 2023

ISSN: ['1936-0975', '1931-6690']

DOI: https://doi.org/10.1214/21-ba1303